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Abstract 

A  simple  analysis  for  an  adaptive  primal-dual  method  for  linear  program¬ 
ming  is  given.  Starting  from  a  pair  of  primal  and  dual  feasible  points  near  the 
path  of  centers,  the  method  maintains  a  worst-case  complexity  of  0(y/n  log 
iterations  to  reduce  the  initial  duality  gap  by  a  factor  of  c.  In  contrast  to  other 
interior-point  algorithms  that  share  the  same  complexity  ([13,  4]  and  many 
others),  the  algorithm  proposed  here  allows  an  acceleration  of  the  rate  of  con¬ 
vergence  (up  to  a  complexity  of  0(\/n log  *))  if  the  problem  is  “well  behaved”. 

Key  words:  linear  programming,  interior-point  method,  theoretical  com¬ 
plexity,  orthogonal  projection 

1.  INTRODUCTION 

We  discuss  interior-point  algorithms  that  apply  one  iteration  of  Newton’s  method 
to  a  set  of  nonlinear  equations  involving  a  parameter  fi  >  0,  and  then  reduce  fi  by 
a  certain  amount  before  repeating  the  process.  For  fi  =  0  the  solution  of  the  set 
of  nonlinear  equations  coincides  with  the  solution  of  a  given  linear  program.  The 
reduction  of  fi  is  of  the  form  fi  :=  fi(l  —  6)  for  some  “steplength”  6  E  (0, 1),  where  the 
focus  is  on  the  size  of  6.  The  method  of  centers  (e.g.  [13])  is  typical  of  “short-step” 
methods  in  which  6  <  1  fy/n  at  every  iteration,  where  n  is  the  number  of  variables. 
Roughly  speaking,  the  shortness  of  the  steplength  6  <  1/y/n  allows  a  proof  that 
the  Newton  iterates  remain  strictly  feasible,  but  for  numerical  implementations  this 
rate  of  convergence  is  too  slow  when  n  is  large.  A  “long-step”  method  is  one  for 
which  6  >  l/y/n.  (Typically  6  is  a  constant  independent  of  n.) 

In  the  recent  past  a  number  of  investigations  have  been  presented  that  analyze 
long-step  interior-point  methods  for  linear  programming  (see  [3,  6,  11,  14]).  Their 
goal  is  to  generalize  the  existing  proofs  of  convergence  of  certain  interior-point  al¬ 
gorithms  to  wider  classes  of  algorithms  and  to  justify  the  correctness  of  existing 
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implementations  of  interior-point  algorithms  that  yield  very  fast  convergence  for 
(most)  numerical  examples.  In  allowing  long  steps,  however,  the  theoretical  com¬ 
plexity  of  these  investigations  degrades.  This  apparently  paradoxical  behavior  is 
due  to  the  fact  that  for  large  steps  the  Newton  iterates  may  no  longer  be  feasible, 
and  therefore  the  method  has  to  use  damped  Newton  iterations  with  a  damping 
factor  a  €  (0, 1).  Loosely  speaking,  the  iterates  may  not  be  able  to  “follow”  such 
a  long  step  6  in  the  parameter  p.  The  goal  of  the  present  paper  is  to  improve  the 
theoretical  complexity.  We  will  consider  an  adaptive  choice  of  the  steplength  6  such 
that  the  full  Newton  iterate  may  always  be  used. 

By  a  thorough  analysis  of  the  integral  over  the  curvature  of  the  path  of  centers 
it  was  shown  in  [16]  that  for  certain  subclasses  of  linear  programs  the  theoretical 
complexity  could  be  reduced  below  0(y/n  log  | )  iterations,  and  in  [12]  a  probabilistic 
analysis  showed  that  the  “anticipated”  number  of  iterations  could  be  reduced  below 
O(v/nlog|),  but  in  a  general  worst-case  analysis  this  complexity  is  (still)  the  state 
of  the  art.  The  analysis  presented  here  is  particularly  simple.  It  also  illuminates 
why  the  theoretical  complexity  could  not  be  improved  so  far  by  any  analysis  that 
focuses  on  the  worst  case  in  a  single  iteration,  rather  than  examining  a  sequence  of 
iterations. 

In  Sections  2. 1-2.4  we  present  a  proof  of  convergence  for  the  method  of  centers. 
The  result  is  well  known  (see  e.g.  [7]),  but  the  analysis  is  new  and  allows  us  to  explain 
in  Section  3.1  where  the  factor  y/n  comes  from,  and  to  analyze  a  new  adaptive 
method  in  Section  3.2  that  automatically  chooses  large  steps  if  Newton’s  method 
for  finding  the  center  converges  “well”,  and  takes  short  steps  otherwise. 


2.  THE  PROBLEM  AND  A  SIMPLE  METHOD 

The  problem  under  consideration  is  the  linear  program 


mincrx,  S  =  {a;  >  0  |  Ax  =  6},  (P) 

r€S 

where  .4  G  IRm*n,  x  €  lRn,  b  6  lRm .  For  brevity  we  assume  that  the  relative  interior 
of  the  feasible  set  5  is  nonempty  and  bounded,  and  that  the  rows  of  A  are  linearly 
independent.  The  dual  problem  to  (P)  is  given  by 


{& y  i  A*y  ^  c)-  (°) 
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where  p  >  0.  The  necessary  (and  sufficient)  conditions  for  a  stationary  point  of 
are 

F»{x,y,z)-.=  ATy  +  z-c  =  0  ,  x,z>  0,  (2.1) 

\  Xz-pe  )  \0j 

where  X  =  diag(xlt . . . ,i„),  e  =  (l,...,l)Te  lRn,  and  z  is  an  auxiliary  variable. 
(In  the  following  a  small  letter  and  the  same  letter  capitalized — like  x  and  X — 
will  always  stand  for  a  vector  and  its  corresponding  diagonal  matrix.)  The  unique 
solution  x(p),  y(p),  x(p)  of  (2.1)  is  the  analytic  center  as  defined  in  [15].  It  is  obvious 
that  x(p)  is  primal  feasible  and  y(p)  is  dual  feasible.  From  (2.1)  it  immediately 
follows  that  the  duality  gap  between  x  =  x(p)  and  y  =  y(p)  is  bounded  by  n/i: 

nn  =  ||H|i  =  ||**||i  =  \\X(c  -  i45y)||i 

<  xT(c  -  ATy)  =  cTx  -  (Ax)Ty  =  cTx  -  b Ty. 

As  first  proved  in  [9],  x(p)  and  y(p)  converge  to  primal  and  dual  optimal  solutions 
as  n  tends  to  zero.  (See  also  [1].) 

2.2.  The  Newton  Step  for  Finding  the  Center 

Suppose  that  x  >  0,  z  >  0  and  y  are  given  such  that  Ax  =  b  and  ATy  +  z  =  c.  For 
H  >  0  we  define  the  residual 

r  :=  Xz  -  pe, 

so  that  FM(x,y,  z)  =  (0,0,  rT)T.  The  Newton  step  for  finding  the  analytic  center 
x(p),  y(n),  z{n)  is  then  given  by 


AAx  =  0, 

ATAy  +  Az  =  0,  (2.2) 

ZAx  +  X&z  =  -r. 

Note  that  if  we  were  able  to  solve  the  (nonlinear)  system  (2.2')  where  in  the  last 
equation  of  (2.2)  the  term  AXAz  is  added  on  the  left-hand  side,  then  we  could  find 
the  exact  center  as  (x  +  Ax,  y  +  Ay,  z  +  Ax),  since  (2.2')  is  then  equivalent  to  (2.1). 

Define  the  positive  diagonal  matrix  D  by  D2  :=  XZ~l.  Then  (2.2)  can  be  solved 
via 

q  =  DX~lr, 

Ay  =  ( AD*AT)-'ADq ,  (t) 

Ax  =  D2ATAy  -  Dq,  '  ‘  ' 

A  z  =  -D~lq-D~2  Ax. 

This  becomes  obvious  as  we  observe  that  (2.3)  is  equivalent  to  (2.4)-(2.6).  First, 

DATAy  =  II/i9  (2.4) 

(since  DAT  has  maximal  rank),  where  11/?  is  the  orthogonal  projection  onto  the 
range  R(DAT)  =  {y  |  y  =  DATw  for  some  w  6  Rm}  of  DAT.  Let  II/v  denote 
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the  orthogonal  projection  onto  the  null  space  N(AD)  of  AD.  Since  N(AD )  is  the 
orthogonal  complement  to  R(DAT),  i.e.  II/i  +  Ilyv  =  J,  it  follows  that 

Ax  =  D(T[Rq  -  q)  =  -DU^q,  (2.5) 

and  finally, 

Az  =  -D~l(q  +  D~lAx)  =  -D~lnRq.  (2.6) 

It  is  straightforward  to  verify  that  (2.4)-(2.6)  satisfy  (2.2). 

2.3.  Analysis  of  the  Newton  Step 

The  residual  after  executing  the  above  Newton  step  (without  damping1)  is  given  by 

f  =  (X  +  AI)(z  +  Az)  -  \ie  —  Xz  +  XAz  +  ZAx  +  AXAz  -  fie 
=  Xz  -  r  +  AXAz  -  fie  =  AXAz  =  AXAz , 

where  Ax  =  -D~1Ax  =  E^q  and  Az  =  -DAz  =  Il^g.  Note  that 

DX-1  =  >/X-lZ-'  =  (x/fl  +  zi/)'1  (2.8) 

(using  Xz  =  r  +  fie).  From  (2.7)-(2.8)  and  the  definition  of  q  in  (2.3)  we  can  readily 
derive  the  classical  convergence  results  about  the  primal-dual  method  as  given  in  [7] 
or  [14].  Assume  that  x  >  0,z  >  0  and  y  are  given  such  that  we  can  find  a  positive 
/x  for  which 

IMIa  <  ft*  (2-9) 

for  some  0  £  [0,^].  (This  means  we  are  assuming  that  x,  y,  z  is  “moderately  close” 
to  the  path  of  centers.)  It  follows  that 

+  (2.10) 

so  that  _ 

\\qh<P\/n/(l-0)=:0,  (2.11) 

and 

||Ax||2  =  0cos0,  \\Az\\2  =  0smO,  (2.12) 

where  9  is  the  angle  between  q  and  II uq.  Finally  by  (2.7),  (2.12)  and  (2.11), 

||f||2  =  \\AXAz\\2  <  42  cos  9 sin  9  <  ~  ■  (2.13) 

Hence,  the  relative  error  ]|Az  -  /xe|| 2/fi  =  ||r||2//x  is  squared  after  each  iteration  of 
Newton’s  method2. 

‘Analyzing  full  Newton  steps  only  is  not  a  severe  restriction,  since  a  damped  Newton  step  can 
be  viewed  as  a  full  Newton  step  for  a  linearly  perturbed  system  of  equations  (the  original  system 
of  nonlinear  equations  to  which  a  linear  perturbation  is  added). 

*Note  that  (2.13)  and  the  first  equation  in  (2.7)  imply  that  (X  +  AX)(z  +  Az)  >  0.  One  way 
to  verify  strict  feasibility  (i.e.  z  +  Ax  >  0  and  z  +  Az  >  0)  is  to  show  that  ||X-1  Az||2  <  1.  By 
(2.12),  ||A'*||  <  0,  and  so  pr1 A*||,  =  HX^ZJa'xIIj  =  IKv/ff+TJ)"1  A**||2  <  -  0)  <  1. 
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2.4.  A  Model  Algorithm 

The  above  analysis  allows  us  to  formulate  the  following  classical  model  algorithm. 
Let  x°  >  0,  y°,  z°  >  0  and  /x°  be  given  such  that  Ax°  =  b,  ATy°+z°  =  c,  X°z°-n°e  = 
r°  and  ||r°||2//i°  <  (Again  this  assumes  that  the  initial  point  is  moderately  close 
to  the  path  of  centers.)  Further,  let  some  desired  accuracy  e  be  given.  Set  k  =  0. 

1.  Perform  one  Newton  step  via  (2.2)  to  obtain  xk+1,  yk+1 ,  zk+1. 

2.  Decrease  fik  to  /x*+1  :=  fik(  1  —  j^j). 

3.  Set  k  =  k  +  1. 

4.  If  nk  <  t/n  then  stop,  else  go  to  Step  1. 

Since  ||r*+1||2  <  ||ffc||2  +  ^j||e||2  <  4  +  <  2  (with  rk  as  in  (2.13))  it  follows 

by  induction  that  all  residuals  rk  fulfill  (2.9)  with  f3  =  and  hence  (by  the  results 
of  Section  2.2)  the  method  is  well  defined.  Since  the  complementarity  parameter 
fi  converges  linearly  to  zero  (Step  2),  the  method  terminates  after  0(y/n log2^-) 
iterations.  This  result  is  well  known  [7]. 


3.  IMPROVEMENTS 

We  shall  now  examine  possible  improvements  of  the  convergence  analysis.  For  this 
purpose,  it  is  useful  to  analyze  which  of  the  estimates  leading  to  (2.13)  are  sharp, 
and  which  ones  can  possibly  be  tightened. 

Note  that  the  relations  (2.7),  (2.8)  and  (2.12)  are  exact.  However  the  bounds 
(2.10)  and  (2.11)  could  also  be  established  under  the  weaker  assumption 

IMIoo  <  (2.9') 

(It  is  interesting  to  note  that  for  any  feasible  x,  y,  z  one  can  always  find  some  /x  >  0 
such  that  (2.9')  is  satisfied  with  /?  =  1.)  Also,  the  bound  (2.13)  is  based  on  the 
inequality  ||S$||2  <  ||s|]2|jt|j2,  which  also  holds  in  terms  of  the  infinity  norm:  ||5t||oo  < 
IMIoolMloo-  _  Finally,  to  guarantee  feasibility  of  x  +  A x,z  +  Ax,  the  inequalities 
||Ax||oo,  ||Ax||oo  <  4  would  suffice  in  place  of  (2.12)3.  Unfortunately,  an  estimate 
corresponding  to  (2.12)  in  terms  of  the  infinity  norm  only  holds  in  the  form 

|| Ax||oo  <  y/nficoaO,  UAzHoo  <  y/n(3&in0.  (2.12') 

We  “lose”  a  factor  of  y/n  both  times.  Stating  (2.13)  in  terms  of  the  infinity  norm 
would  thus  “lose”  a  factor  of  n. 

3  An  analysis  in  terms  of  the  infinity  norm  would  be  particularly  interesting,  since  by  the  update 
in  Step  2  of  the  model  algorithm  the  residual  r*+l  is  obtained  from  rk  by  r*+1  =  fk  +(//  —  /ifc+,)e, 
and  the  infinity  norm  of  t  is  smaller  than  its  2  norm:  IMIoo  =  ^-||e||a. 
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3.1.  Understanding  the  Analysis 

The  purpose  of  the  following  analysis  is  to  shed  some  light  on  this  apparent  “in¬ 
compatibility”  of  norms.  Assume  for  the  moment  that  for  some  ft  =  fik  the  exact 
center  x,y,z  is  given.  (A  “central”  analysis  is  particularly  interesting  since  it  has 
been  observed  (see  e.g.  [2])  that  at  the  center  all  the  search  directions  suggested  for 
interior- point  algorithms  since  Karmarkar  [5]  are  equivalent.)  At  the  center,  D,  r 
and  q  reduce  to 

D2  =  XZ~l  -  —  X2,  r  =  (fik  -  nk+1)e  =:  Sfie,  q  =  6y/Jie,  (3.1) 

where  6  €  (0,1).  Using  the  Newton  step  for  finding  x(nk+1),y(/ik+l),z(nk+1)  via 
(2.2)  we  obtain  (using  (2.5)) 

A*  =  -6X11*6.  (3.2) 

What  is  the  largest  possible  steplength  6  that  keeps  the  next  iterate  strictly  feasible? 
Clearly,  x  -j-  Ax  >  0  is  maintained  iff  6II#e  <  e.  As  mentioned  in  (2.12'),  the 
orthogonal  projection  11#  may  increase  some  components  of  e  by  0(y/n).  A  simple 
example  to  illustrate  this  worst  case  is  A  =  (— y/n,  1, . . . ,  1)  6  HRlx^n+1l,  x  —  z  — 
c  —  e  —  (1,...,1)T  E  Rn+1,  y  =  0  and  6  =  n  —  y/n.  These  vectors  satisfy  (2.1) 
with  n  =  1,  i.e.  x,y,z  form  a  center,  and  II/ie  =  ” /n,  1, . . . ,  l)r,  and  hence 
II#e  =  — 2^(y/*i  l,.--,l)r-  The  infinity  norms  of  n^e  and  II#e  are  larger  than 
IMloo  by  a  factor  of  (nearly)  y/n/ 2. 

We  emphasize  that  in  this  situation  the  search  direction,  even  when  secured  by 
a  linesearch,  does  not  allow  a  “long”  step4!  This  is  particularly  interesting  since 
all  interior-point  methods  (also  the  “long-step”  methods)  generate  the  same  search 
direction  at  this  point.  To  guarantee  feasibility  of  x  +  Ax  in  the  worst  case  for  a  fixed 
steplength  S  we  therefore  need  6  <  2/ y/n,  almost  as  in  the  model  algorithm.  Thus 
the  factor  y/n  can  be  interpreted  as  the  “magnification”  of  certain  components  of  a 
vector  during  its  orthogonal  projection,  rather  than  a  result  of  “incompatibility”  of 
the  norms  or  a  result  of  our  inability  to  prove  the  best  possible  bound. 

At  this  point  we  may  ask  for  a  geometrical  interpretation  of  the  orthogonal 
projection  and  of  the  fact  that  it  sometimes  magnifies  certain  components  of  the 
vector  e,  i.e.  that  Ull/vejloo  =  o(V»)  »  i  =  M~.  Given  a  point  x,  y,  z  on  the 
path  of  centers,  it  is  well  known  that  -XII#e  is  the  direction  of  the  tangent  (in 
the  primal  space)  to  x(/t).  (To  see  this,  simply  differentiate  (2.1)  with  respect  to 
H  and  observe  that  the  resulting  equation  is  exactly  (2.2)  with  r  =  -e  and  with 
the  derivatives  x' ,  z'  in  place  of  Ax,  A z.)  Now,  if  some  components  of  II#e  are 
»  1  (the  case  <  -1  is  similar  with  z  in  place  of  x)  then  the  linear  approximation 
x(/z(l  -  £))  as  x  -  6 Ax  to  the  path  of  centers  has  a  short  “trust  region”,  since  even 
for  small  6  (any  6  >  l/||n#e||oo)  the  points  x  —  6 Ax  are  not  feasible,  i.e.  not  near 

4  The  conventional  notion  of  short  and  long  steps  is  somewhat  confusing.  Of  course  one  may 
decrease  the  complementarity  parameter  p  by  a  long  step  at  this  point  (and  this  is  the  general 
understanding  of  a  long  step).  What  this  example  really  shows  is  that  the  variables  z,  y,  z  may 
not  be  able  to  follow  such  a  long  step;  i.e.  a  damped  Newton  step  is  required,  and  the  actual 
complementarity  gap  is  only  reduced  by  a  small  amount. 
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the  path  of  centers.  This  implies  that  the  second  derivative  of  x(p)  must  be  large 
near  pk. 

Similar  considerations  also  hold  for  “non-central”  points  Xz  —  pw  where  w  e 
is  a  positive  weight  vector.  For  estimates  about  the  curvature  of  x(p)  we  refer  to 
[16]. 


3.2.  An  Adaptive  Method 

If  the  path  of  centers  is  “well  behaved”  we  do  not  anticipate  the  worst  case — that 
the  infinity  norm  of  the  residual  is  increased  by  a  factor  of  0(y/n) — in  each  step  (see 
[12]).  The  hope  that  the  estimates  used  in  the  model  algorithm  are  too  pessimistic 
in  the  average  case  suggests  the  following  adaptive  choice  of  the  steplength. 

Let  us  again  take  (3  =  ^  as  in  the  model  algorithm.  Given  xk,yk,zk  and  p  —  pk 
such  that 

we  wish  to  determine  p  :=  pk+1  as  small  as  possible  such  that  the  residual  rk+1 
following  the  Newton  step  for  finding  x(p),y(p),z(p)  satisfies  the  analogous  bound 

||r*+1||2  =  ||Xt+>J‘+,-Ae||I<|,  (3.3) 

where  xk+1  =  xk  +  Ax*  and  z*+1  —  zk  ■ f  A zk  are  obtained  from  xk  and  zk  by  (2.2) 
with  r  =  rk  :=  Xkzk  -  pe.  If  we  set  p  :=  p(l  -  6)  (for  some  S  =  6k  >  0),  then 
it  follows  that  fk  =  rk  +  Spe.  The  component  of  Axk  =  DHpfDX~1fk  resulting 
from  rk  is  often  referred  to  as  the  centering  direction  (it  brings  the  iterate  closer  to 
the  center  x(p)),  while  the  component  resulting  from  6e  is  aimed  at  reducing  the 
duality  gap  and  is  usually  referred  to  as  the  affine  scaling  direction.  By  (2.7)  the 
next  residual  is  given  by  rk+i  =  AXk Azk ,  where 

Axk  =  n  NDX~lfk,  Azk  =  nRDX _1ffc,  (3.4) 

(where  x  =  xk,  z  =  zk,  D 2  =  XZ~l).  If  we  define  qk  =  DX~lrk,  sk  =  DX~xe  and 

Vi  =  II  Nqk,  v2  =  qk-v  i,  v3  =  n^s*,  v4  =  sk-v 3, 

then 

Ax  =  Vi  4-  fv3,  A z  =  v2  +  6v 4. 

This  shows  that  (3.3)  is  equivalent  to 

ll"J,+,ll?  =  ll(V.  +  ^  +  (3.5) 

which  in  turn  is  equivalent  to 

p{6)  :=  oq  +  cl\6  +  a2S^  +  a363  -I-  04^^  <  0,  (3.6) 
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where 


a0  =  vjvl  -  p2/4, 

ax  =  2(uxt>3«2  +  t>2t>4t>i  +  M/4), 

a2  =  v%v$  +  v%v%  +  4wiV2U3V4  -  1/4,  (3.7) 

a3  =  2(«iv3V4  +  v2v4vl), 
a4  =  v%vj, 

and  each  “product”  of  four  vectors  “abed”  stands  for  the  sum  Ylj=i(ajbjcjdj). 

The  maximal  permissible  steplength  6  is  therefore  given  by  the  largest  positive 
zero  of  p  that  is  less  than  one5.  (This  6  =  6k  can  be  computed  in  constant  time.) 
Thus  we  obtain  a  simple  modification  of  the  model  algorithm  by  changing  Step  2  to 

2/.  pk+l:=pk(l  -6k). 

3.3.  Achievable  Gains 

The  analysis  of  the  model  algorithm  in  Section  2.3  has  shown  that  a  steplength  of 
S  —  fi/(4y/n)  is  possible  in  the  worst  case,  i.e.  that  p(S)  <  0  for  S  6  (0,p/(4x/n)). 
We  now  examine  how  long  the  step  might  be  on  average.  We  will  call  a  vector  v 
well  balanced  if 

IMloo  < 

for  some  small  constant  k  >  1.  This  excludes  the  case  that  v  has  only  a  few  large 
components. 

Let  us  observe  first  that  the  vector  sk  is  well  balanced,  since  l/y/2  <  sk  <  y/2 
from  the  estimates  (2.8)  and  (2.9). 

To  find  how  large  we  may  expect  6  to  be  in  the  “best  case”  we  assume  that  the 
projections  V3  =  II NSk  and  =  sk  —  v3  are  also  well  balanced,  which  essentially 
excludes  the  case  that  some  of  the  components  of  sk  are  magnified  by  more  than  a 
constant  factor  during  the  orthogonal  projection  II at.  This  allows  a  tight  bound  on 
the  size  of  the  coefficients  of  p(6)  by  the  following  Lemma. 

Lemma 

1.  If  a  and  b  are  well  balanced,  then  abed  <  ^||a||2!!&||2||cl|2!l^||2- 

2.  If  a  is  well  balanced,  then  abed  <  ■^||a||2||&||2|M|2||d||2- 

Proof 

1.  J2ajbjcjdj  <  max|aj|  max|6;j  <  max  |ay |  max  |67|  ||c||2||d||2  < 

(The  second  inequality  is  the  Cauchy- Schwarz  inequality, 
and  the  third  uses  that  a  and  b  are  well  balanced.) 

Mt  is  easy  to  see  that  if  p  is  nonnegative  in  [0, 1]  we  may  choose  6  =  1  and  thus  find  the  solution 
of  (P)  in  one  step.  Unfortunately,  this  will  not  happen  in  general,  and  so  we  assume  that  p  has  a 
zero  in  (0, 1). 
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2.  Same  as  above,  only  in  the  third  inequality  we  use  ||6||oo  <  II&H2  instead  of 

IMI~  <  #11*11*  ■ 

By  definition  of  u,  and  by  (2.8)  and  (2.10),  it  follows  that  ||t>i||2,  || V2II2  <  vW2 
and  || V3II2,  ||^4 II2  <  Further,  inequality  (2.13)  (with  tq  =  Ax  and  v?  =  Ax) 

states  that  v2v2  <  /x2//34  =  /t2/16.  From  these  estimates  and  the  lemma  we  obtain 

ao  <  <  0,  ai<  2/z(|  +  f  =  (2«  +  \)P, 

02  <  6«2  —  7,  03  <  8/«2-^,  04  <  4«2-^. 

4  /i  /i2 

In  this  case  we  can  verify  that  p(l/(16/Cv^))  <  0,  so  that  in  the  “best  case”  of  the 
adaptive  algorithm  we  may  expect  a  steplength  of  0(1/ \/n),  while  the  worst-case 
bound  of  0(l/y/n)  on  the  steplength  is  maintained  regardless. 

The  factor  y/n  still  exists  because  the  iterates  remain  close  to  the  path  of  centers 
(in  the  2  norm).  If  we  knew  in  advance  that  all  projections  were  well  balanced,  then 
the  whole  analysis  could  be  carried  out  in  terms  of  the  infinity  norm  as  indicated 
earlier,  and  the  rate  of  convergence  would  be  independent  of  n. 

4.  CONCLUSIONS 

The  short-step  interior-point  methods  that  allow  “nice”  polynomial-time  proofs  of 
convergence  for  linear  programming  turn  out  to  be  much  too  slow  for  practical 
algorithms.  Thus  a  number  of  “long-step”  methods  have  been  analyzed  to  date,  most 
of  which  are  aimed  at  proving  the  correctness  of  existing  numerical  implementations; 
see  in  particular  [6]. 

In  Section  3.1,  however,  we  have  presented  a  worst-case  example  in  which  the 
iterates  x,  y,  z  are  not  able  to  “follow”  a  long  step  in  the  reduction  of  the  comple¬ 
mentarity  parameter  /z.  The  possibility  of  such  worst  cases  is  responsible  for  the 
weak  proofs  of  convergence  for  long-step  methods.  These  proofs  not  only  fail  to 
explain  the  fast  convergence  of  the  implementations  that  has  been  observed  for  all 
numerical  examples,  but  also  exhibit  a  worse  theoretical  complexity  than  even  the 
short-step  methods. 

The  adaptive  method  presented  here  is  intended  to  close  the  gap  between  the¬ 
oretical  and  practical  complexity.  If  we  were  able  to  show  that  some  percentage  of 
the  projections  of  the  vectors  sk  are  well  balanced  over  a  sequence  of  iterations,  then 
we  could  indeed  reduce  the  theoretical  complexity.  This  is  the  goal  of  future  work. 
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